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1 Introduction 



Quicksort was chosen as one of the 10 most important algorithms. Quick- 
sort serves also as a challenging random divide-and-conquer algorithm for a 
mathematical analysis. Starting with the worst case, the best case and the ex- 
pected running time [7] we know nowadays much finer results on the limiting 
distribution, the existence [13] via martingale methods and a characterization 
[H] as a stochastic fixed point. The running time for many versions, actu- 
ally all versions I know of, can be analyzed [TUl E] by the tools (contraction 
method and Weighted Branching Process [18J ) invented for Quicksort. 

Around 2004 Conrado Martinez [8] came up with the algorithm Partial 
Quicksort, a mixture of Quickselect and standard Quicksort. Given as input 
a list of n numbers, find the numbers between the fc-th and the /-th smallest 
in order. A special case is Quicksort on the fly, where the algorithm tries 
to finish always the left most list. The output is first the smallest then the 
second smallest element and so on as an incoming stream. 

What can we say about the running time up to see the /-th smallest? 
Martinez gave an explicit formula for the expected time (jl]) and showed 
an asymptotic result. This was an indication for a possible distributional 
result. In 2010 Martinez and Roesler showed convergence of one dimensional 
marginals to a limit, which required already some effort. This paper is on 
the convergence of finite dimensional marginals and, moreover, the existence 
of a limiting process for Quicksort on the fly with cadlag paths in D. Most 
of this work is in the dissertation of Ragab |12j . 

The existence of the Quicksort process, Theorem [IJ uses a specific con- 
struction via a Weighted Branching Process (WBP), see section 2 for the 
WBP and section 3 for the existence. The construction uses a specific se- 
quence of rvs (R n ) with values in D and the convergence is a uniform con- 
vergence of paths in an L2 sense. We could take the supremum norm in D, 
since the approaching sequence has the same(!) jump points path wise as 
the limit. 

For the convergence of the discrete processes to the limit we again use a 
WBP on the binary tree. Technically we embed the Quicksort process and 
the discrete version all at the same time into a WBP on the binary tree with 
very specific rvs. An additional parameter n G IVU{oo} stands for the input 
list length, n = 00 for the limit. The basic idea is to find appropriate rvs on 
the binary tree such that we have a limit (of R n ) as n — > 00. The main result 
of this paper is Theorem |5]on weak convergence of processes. This embedding 
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seems to be the right object in order to obtain stronger convergence results. 
Notice the interplay of rvs and distributions and that we have to take nice 
versions of rvs in order to obtain (only) distributional results. We face a 
similar interplay (and technique) in the analysis of Find [31 [T71 16]. 

After this rough idea of the abstract embedding of this paper into the 
existing literature and of its contribution, we shall give some more details. 

Quicksort on the fly sorts the input of n distinct numbers and returns first 
the smallest, than second smallest and so on at the time of identification. The 
output is a stream of data in time. We are interested in this flow of data and 
will analyze the difference to the average behavior. 

Like in Quicksort jH |5], choose with a uniform distribution a pivot from 
the input set S of distinct numbers, split S into the set S < of strictly smaller 
ones than the pivot, the pivot and the set 5> of strictly larger ones in this 
order. Then continue recalling Quicksort independently always for the left 
list S < . If S < is empty continue with the next leftmost sublist recalling the al- 
gorithm. If S < consists only of one element, output this number immediately 
and continue with the next leftmost list. 

Let X(S, I), I = 1, 2, . . . , \S\ denote the number of comparisons made up 
to the event when the /-th smallest number in the set S is determined. The 
interpretation of X(S, I) is as proportional to the time of publishing the l-th 
smallest. The rv X = X(S, \ satisfies the recursion 

(X(S, l))\% = (\S\ - 1 + JWX(S<, + 1 W (X(S<, / - 1) + X(S >: I - /)))£! 

(1) 

for \S\ > 2. Here / = I(S) = \S<\ + 1 denotes the rank of the pivot and 
has values in {1, 2, . . . , n} with a uniform distribution. The rv I — I(S) is 
independent of all X-rvs on the right of equation ([T]). The rv X(S < ,-) is 
if S is the empty set or has only one element. Otherwise the rv satisfies 
a similar recursion, where I(S<) is independent of I(S). (Mathematically 
correct: 1(A), 7^ A C S are independent with 1(A) a uniform distribution 
on A and the X are recursively defined.) Continuing this way we find the 
distribution of X(S, \ S\) as the Quicksort distribution sorting S by standard 
Quicksort. 

The equation (Q4) determines the distribution of (X(S,l))\J 1 via the dis- 
tribution for smaller sets. The distribution depends on S only via the size 
n — I S\ of S. (Prove this by induction on n and notice the distribution of I(S) 

depends only on the size n of S.) For that reason we write X(S, I) = X(\S\, I). 
For notational reasons as above, which corresponds nicely to the interpre- 
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tation, we use the boundary conditions X(n, 0) = for all n G IN and 
X(0, •) = X(l, •) = 0. Then the above recursion writes for n > 2 

(X(n, /))r=i = (n-l+l l<In X\l n -l, l)+l l > In (X 1 (I n -l, I n -1)+X 2 (n-I n , l-I n )))? = 

(2) 

and determines the distribution of (X(n, l))f =0 , n G IV, by the previous ones. 
The rvs I n , (X l (j,k)) J k=0 , i — 1, 2, 1 < j < n are independent. The rv I n 
has values in {1, 2, ... , n} with a uniform distribution, X l (j, •) has the same 
distribution as X(j, ■) given by recursion. Notice X(n,n) — X(n, n — 1). 

In our version of Quicksort we use internal randomness by picking the 
pivot with a uniform distribution. Like in standard Quicksort, we could 
instead of internal randomness also use external randomness. Choose as input 
an uniform distribution on all permutations 7r of order n and pick as pivot 
any, for example always the first in the list. Now X(tt, •) is a deterministic 
function depending on the input tt. Seen as a rv with random input n we 
face the same distribution as with internal randomness. The main advantage 
using internal randomness is that X(tt, •) has the same distribution for every 
input 7r of the same size. Alternatively we could start with n iid random 
variables uniformly on [0, 1] as the input and choose as pivot always the first 
element of the list. The algorithm itself would be deterministic, the running 
time is a rv via the input of an iid sequence and has the same distribution 
as our X(n, •). 

From equation fl2]) we obtain a recursion for the expectation a(n, I) = 
E(X(n,l)), n > 2, 1 < I < n 

1 1 1 " 

a(nJ)=n-l + -J2(a(j-l,j-l)) + a{n-j,l-j)) + - V a(j-l,l) (3) 

j=l j=l+l 

The term a(n,n) is the expectation of sorting n numbers by Quicksort. All 
a(n, I) are uniquely defined by the above equations and the starting condi- 
tions. Martinez [8] obtained the explicit formula 

a(n, I) = 2n + 2(n + l)H n - 2(n + 3 - l)H n+1 ^ - 6/ + 6 (4) 

1 < / < n G IN. Hj denotes the j-th harmonic number Hj = Yll=i ~- (Notice 
a(0, 0) = = a(l, 0) = a(l, 1) but in general the formula (T4J) is not EX(n, I) 
for n > 2 and 1 = 0.) For Quicksort we obtain the well known formula 
a(n, n) = 2(n + l)H n - An. 
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Martinez argued with Partial Quicksort PQ(n,l), which for fixed n,l 
sorts the I smallest elements of a list. For more results and versions of it, 
optimality and one-dimensional distributions for Partial Quicksort see |10j . 
The Quicksort on the fly process is an extension of Partial Quicksort in the 
sense of taking I as a time variable and considering processes. We find first up 
to the I — 1-smallest elements, then continue this search for the l-th smallest, 
then I + 1-smallest and so on. 

Now we come to the distribution of the process X(n, •) in the limit, where 
X is defined via ([2]). This includes the question, how much X{n, I) differs 
from the average a(n, I). Define the rvs 

I = X(n,l)-E(X(n,l)) 
n n n 

for I — 0,1, ... ,n. These rvs satisfy the recursion, n > 2 
(Yn(-)) = (CMQ + h^J—Yl^-t-) 

n n n — l n 

C(n,l,i) = — (n — 1 — a(n, I) + I^Aaii — 1, 1 + 1) 
n 

+ l i+l=l a(i - 1, % - 1) + lz>i(a(i - 1, i - 1) + a(n - i, I - i)) 
C(n,0,i) = 

for I — 1, . . . ,n — 1. Notice Y n is well defined, with the help of the indicator 
function, and there are no boundary conditions besides Y = = Y\. 

We extend the process Y n nicely to a process on the unit interval [0, 1] 
with values in the space D = D[0, 1] of cadlag functions (right continuous 
functions with existing left limits [TJ) on the unit interval. This can be done 
by linear interpolation or a piece wise constant function. We shall use the 
extension 

Y n (t) := Y n {^). (6) 
The process Y n with values in D satisfies the recursion, we use U n = — 

Y n £ (C(n,[nt\,I n ) + l^J-^zlYl^-^-Al) 
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for n G IV. In short notation 

Y n = <f n (Un, (Y£) k<n , (Y*) k<n ) (7) 

for a suitable function ip n . 

If n — > oo then C/ n converges in distribution to a rv 17 with a uniform 
distribution. We might expect that the process Y n converges in some sense 
to a limiting process Y = (F(t)) te [o,i] with values in D satisfying something 
like the stochastic fixed point equation 

y = (i t< ^y 1 (^) + w(^ 1 (i) + (i-t/)^ 2 (^)) + c'(t ) [/)) i 

Y E mY\Y 2 ) (8) 

for a suitable function if. The rvs Y 1 ,Y 2 ,U are independent. Y 1 and Y 2 
have the same distribution as Y and U is uniformly distributed on the unit 
interval [0, 1]. The cost function C = C(-, U) is given by 

C{t,x) := C{x) + 21 t<x {-l + x + (1 - t) ln(l - t) - (1 - x) ln(l - x) 
-(x-t)ln(x-t)) (9) 
C(x) := 1 + 2xlnx + 2(l - x) ln(l -a;) (10) 

and is the limit of C(n, l n , i n ) as n — > oo with ^ — > n t, ^ — y n x, Proposition 

m. 

Our first major result, Theorem states the existence of the F-process 
with values in D. 

Theorem 1 Let U v , v G V, be iid rvs with a uniform distribution on [0, 1] 
and V be the binary tree. Then there exists a family Y v , v G V, of rvs with 
values in D and all of the same distribution satisfying almost surely 

Y v = <f(U v ,Y vl ,Y'" 2 ) (11) 
Y v (t) = l t<uvU v Y v \^) + l t > uv {U v Y v \l) + {l-U v )Y^ 

simultaneously for all t G [0, 1]. 
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The above equation fill I) for rvs implies the distributional equality and Y = 
Y® satisfies the fixed point equation (JSJ). A specific family Y v is explicitly 
given in the paper. We call Y® the Quicksort process and its distribution the 
Quicksort process distribution. 

Our second major result, compare Theorem [8] and Corollary |9l states the 
weak convergence of Y n to the Quicksort process constructed above. 

Theorem 2 All finite dimensional distributions ofY n without the coordinate 
zero converge to those of the Quicksort process Y constructed above. 

The first result is a probabilistic result, while the second is a measure 
theoretic one. We obtain both results via the Weighted Branching Process 
[To] and an explicitly given nice family of processes Y£ indexed by n e IN 
and the binary tree. Basically we use the splitting [/-rvs for the F-process 
also for the F n -process. 

Inspired by previous work on Find [6] one might choose the pure mea- 
sure theoretic approach by first showing the convergence of finite dimensional 
distributions of Y n by the contraction method obtaining a limiting measure 
on iR' 0,1 '. Then show the tightness pQ of the sequence of measures in order 
to obtain a limiting measure with outer measure 1 of D. The measure re- 
stricted to D provides the desired distribution. There exist rvs satisfying (]§]). 
(Statement (1TT]) is slightly stronger.) Neininger and Sulzbach [TTJ study this 
approach in more generality using the Zolotarev metric. 

We prefer here a probabilistic approach using rvs instead of measures. 
Actually we formulated a (seemingly) stronger statement, the existence of 
a family Y v of rvs with values in D and indexed by the binary tree, which 
satisfies (JS} almost everywhere as rvs. For the construction we used a.e. 
convergence of appropriate rvs in supremum metric. A similar probabilistic 
approach via the Skorodhod metric was used [3] for Find, the first process 
analysis of a stochastic algorithm. 

Also for Quicksort we find the two approaches in the literature with dif- 
ferent types of results. The probabilistic approach by Regnier [13] provided 
a limiting rv via an L 2 -martingale. The measure theoretic approach via the 
contraction method [T5] and the backward view characterized the limiting 
distribution as unique solution of a stochastic fixed point equation. Via the 
approach of Weighted Branching Processes one can construct a family of 
tree indexed rvs satisfying the stochastic fixed point equation as rvs. (An- 
other L 2 -martingale R n = Yli^vC v , different from Regnier's, is the key to 
convergence.) 
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We stated the results for processes with values in D. This is because 
D is preferred to the space E of left continuous functions / : [0, 1] — > M 
with existing right limits. Both spaces are isomorphic with respect to the 
topological, probabilistic and algebraic structure. (The easiest argument is 
via the map [0, 1] 3 x i— > 1—x.) The space D is preferred in probability theory 
(PP) and this is our main argument for using the space of cadlag functions. 
This explains the motivation for our normalization in fl5J). 

Using the same arguments, the path wise left continuous version of Y 
is a solution in E of the corresponding equation fllip in E. For the discrete 
setting and interpretation it seems on the first view more natural to use left 
continuous functions. If we know the Z-th largest and are only interested in 
that one, we do not have to look to the future, we are done. The D and E 
versions are mathematically equivalent. 

2 The Weighted Branching Process 

We introduce Weighted Branching Processes (WBP) in general and will spe- 
cialize to Quicksort examples on the binary tree. 

Let V be the Ulam-Harris tree IN* = U^ =0 lN n of all finite sequences of 
natural numbers including the empty sequence denoted by the empty set 0. 
(By convention W° = {0}.) V is a rooted tree with root and the edges 
(v,vi), v G V, i G IN in graph theoretical sense. We use the standard 
notation v = (fi, 1*2,^3, . . . ,v n ) = v±V2 . . .v n for a vertex v G V of length 
n — \v\. We skip the empty set in the notation whenever possible. We use 
V n for v G V of length n and V< n , V <n appropriate. 

A weighted branching process (WBP) is a tuple (V, (G, *,H,Q), (T, C)). 
(T, C) is a random variable with values in G M x H on some probability space. 
(Actually we need only this distribution for the description, but prefer here 
a probabilistic language.) (G, *) is a measurable semi group (* : G x G — > 
G, (g,h) i — y g * h associative and measurable) with a neutral element e 
(Vp G G : e*g — g — g*e) and a grave A (Vp G G : A*g = A = g* A, once in 
the grave forever in the grave). (G, *) operates left on H via : G x H — > H. 
if is a measurable space and G x H endowed with the product cr-field. 

Let (T V ,C V ), v G V, be independent copies of (T,C) on the same prob- 
ability space (Q,A,P). The interpretation of C v is as a cost function on a 
vertex v and where T = (7\, T 2 , . . .), is a transformation (weight) on the 
edge (v,vi). The interpretation of G is as a map from H to H. If H has an 
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additional structure then we might enlarge G to have the induced structure. 
Examples are if is a vector space or an ordered set and the extended G will 
be a vector space or ordered set via the natural extension. 

For a WBP define the path weights : ft — > G, v, w G V on paths 
(v, vw) recursively by = e and 

TV _ r V j, rpvw 
^wi — L » * J i ■ 

One of the basic assumptions of a WBP is the independence of families, 
but arbitrary dependence within a family. Let A n be the cr-field generated by 
all Ti,C v , for v G V <n ,i G IV. Then L v w is measurable with respect to A\ vw \ 
and C v is independent of .Au. We will use this many times in the sequel. 

For a WBP without costs we write also (V, (G, *),T). We drop the vertex 
whenever possible, e.g. L v = L\ or later i? = R, S 10 = S. The interpretation 
of L v w the grave is, we can not see the path from the root v to vw. Mathe- 
matically, no value grave connected to the path (v, vw) will ever contribute, 
like L" W C VW will be in our examples if L v w = 0. By this construction we shall 
use freely other trees like m-ary trees {1,2,..., m}* = U ne ^v {l, 2, . . . , m} n 
of all finite sequences over the finite alphabet 1,2, ... ,m in an appropri- 
ate sense. For the m-ary tree we take T = (Ti, T 2 , . . . , T m ) instead of 
T = (Ti, T 2 , . . . , T m , A, A, . . .) on the original tree IN*. 

For the next sections we need the following two examples. Although they 
provide known results the novelty is the line of arguments, which can be 
generalized and which are the key for the Quicksort process. 

Example 1: Quicksort distribution [T3]: Here we show mainly the exis- 
tence of the Quicksort distribution via an embedding into the WBP. Consider 
the WBP (V = {1, 2}*, {JR, ;M,-), {{U, 1-U), C{U))) with U has a uniform 
distribution and 

C(x) = 1 + 2x\nx + 2(1 - x) ln(l - x) 

as in 0. 

G is the multiplicative semi group M with the neutral element e = 1 and 
the grave A = 0. G operates left on H = M by multiplication. Let U v , v G V, 
be independent rvs with a uniform distribution on [0, 1]. Put 

TI = U v , T% = \- U v , C v = C{U V ). 

(For the general WBP with tree IN* we could take T? — A = for i > 3. 
The smaller binary tree is more suitable here.) Since H is an ordered vector 
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space, we extend G with the interpretation of maps to the ordered vector 
space generated by the maps. 

The total weighted cost R m := J2 v &v <m L V C V up to the m — 1 generation 
is an L2-martingale and converges in L 2 and a.e. to a rv Q [14] , The distri- 
bution of Q is called the Quicksort distribution. The distribution is uniquely 
characterized [T4"l [2] as the solution of the stochastic fixed point equation 

Q = UQx + (1 - U)Q 2 + C{U) (12) 

T> 

with expectation 0. Here = denotes equality in distribution. The random 
variables U,Qi,Q 2 are independent, U is uniformly distributed and Q±,Q 2 
have the same distribution as Q. 

By the a.s. convergence of = ^2 weV m L V W C VW the rvs 

Q v := L l° VW ( 13 ) 
wev 

exist and satisfy a.e. 

QV = fJVQVl + JJV^QV2 + QfjJVj 

for every v E V. Of course the distribution of Q v is a solution of f lT2|) and is 
the Quicksort distribution. 

Example 2: Convergence of the discrete Quicksort distributions [14J: 
The original problem concerns the number X n of comparisons to sort n dis- 
tinct reals. We use internal randomness. Then for n E IN 

X n ^n-l + X\_ x + X 2 n _ In 

with I n , X X ,X 2 are independent, I n has a uniform distribution on 1, . . . ,n 
and X},Xf have the same distribution as X{. The boundary conditions are 
X and X\ are identical 0. The expectation of X n is a n = a(n,n) as in fll]). 
The normalized rvs Y n = Xn ~ an satisfy the recursion 

where 

C Ti 1 Q n -|- di—\ ~\~ dn—i 
n{l) ■= 

n 
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We come now to the abstract embedding of this example into a WBP with 
an additional parameter n G IN. Let H be the set of functions h : JN — > M 
and G the set H x G 2 where G2 are the functions g : INq — >■ Wo satisfying 
(7(0) = and g(n) < n for all n G W. The semi group structure is given by 

ffi) * (/2, #2) = (/1/2 o # 2 , #2 o 0i) 
and the operation on H via 

(f,g)Qh = fhog ((/, «?) © fc)( n ) = /(n)%(n)) 

(o denotes composition and we use multiplication on 

The interpretation of (/, g) G G is as a map on f/T, where / is a multi- 
plicative factor and <? an index transformation. The operation * corresponds 
to the convolution of maps on H. Since H is a vector space we may enlarge 
G naturally to a vector space. 

Consider the binary tree V = {1,2}* and let U v ,v G V, be independent 
rvs with a uniform distribution. Let 1% := |~7iC/"] (upper Gauss bracket) and 
define the transformations on the edges (v,vl), (v,v2) by 

J?(n):=i^-1 r 2 (n):=n-F n 
T» : = J») T 2 » := J 2 ») 

and the vertex weight 

C» := CM) 
The rvs _ 

wev< m 

converge as m — )■ 00 a.e. and in L 2 to a limit i2" and satisfy 



R V m = J2 T > &RV ^ +CV 



for me N = iVUjoo}. 

Notice R v m and R v take values in H and R v (n), n G W, is a random 
variable with values in the reals. Notice the connection to the previous 
description of the Quicksort rv. Y n from the introduction 



Y n ^R\n) 
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R v (n) converge for every v G V in L 2 to the rv Q v from the Quicksort 
example [H]. We shall use Q v n = R v {n) in the sequel and drop the root 
whenever possible. 

Example 3: Joint embedding: It is worth while to put the two examples 
together. Use iV = IVq U {00} instead of Wo in the second example and 
incorporate the first example via the value 00. 

3 The Quicksort Process 

In this section we consider the Quicksort process. Let D = D([0, 1]) be the 
vector space of cadlag functions / : [0, 1] — >■ JR (right continuous with existing 
left limits). D is endowed with the Skorodhod topology PQ induced by the 
Skorodhod Ji-metric 

d(f,g) = inf{e > | 3A G A : ||/ -go A||qo < e, ||A — id||oo < e} (15) 

where A is the set of all bijective increasing functions A : [0, 1] — > [0, 1]. We 
use the supremum norm ||/||oo = su Pt 1/(^)1- 

The space (D, d) is a separable, non complete metric space, but a polish 
space [Tj. The cx-field o{D) is the Borel-cr-field via the Skorodhod metric. 
The cx-field is isomorphic to the product cx-field M A PI D where A is a dense 
subset of [0, 1] containing the 1. 

Let F(D) be the space of all measurable functions X with values in D. 
For 1 < p < 00 let F P (D) be the subspace such that 

llXHoo^ := || HXlloollp < 00 (16) 

is finite. Here || • \\ p ,p < 00, denotes the usual L p -norm for rvs. The map 
|| ■ || is a pseudo metric on jF p (D). Let ~ be the common equivalence 
relation 

X ~ Y P{X ^Y) = 

and F P (D) be the set of equivalence classes [X] = {Y G F(D) \ X ~ Y} 
intersected with J-" P (D). Then it is well known 

Proposition 3 (F P (D), || • ||oo, P ) is a Banach space for 1 < p < 00 with the 
usual addition and multiplication 

[/] + [<?] = [/ + 9], C[f] = [cf}, ||[/]|| p ,oo=||/||oo, P . 
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In the following we will be careless and will not differ between functions and 
equivalence classes. 

Let (G, *) be the semi group G = D x where consists of increasing 
functions D 3 g : [0, 1] — > [0, 1] and the semi group operation * is 

(A, ffi) * (/2, 92) ■= (/1/2 o g u g 2 o g x ). 

(G, *) has as neutral element (1, id), the function identically 1 and the iden- 
tity, and the grave is (0, id). G operates left on H = D via 

{f,g)Qh:= fhog. 

The tuple (/, g) G G has the interpretation of a map Mf j9 from H to H 
acting as 

M fj ,(h)(t) = f(t)h(g(t)). 

The first coordinate / acts as a space transformation, the second coordinate 
g as a time transformation. The semi group structure * is the composition 
of the corresponding maps 

M (/i,si)*(/2,32) = M h,gi M hm- 

(Notice the order of the composition.) Since if is a vector space and M is a 
lattice, we will embed G to maps H H and use freely the induced structure 
+, •, V on G, V denotes the supremum, a G M 

(M fugi + M hm ){h) = M fugi (h) + M hm (h) 

a-(M Lg (h)) = (a-M Lg )(h) 

(M fl , gi V M ht92 )(h) = (M fl>gi (h)) V {M ht92 {h)). 

Let V = {1,2}* be the binary tree and let (U V ,Q V ), v G V, be the 
rvs as in the Quicksort example in the WBP section. Define on the edges 
(v, vi), v G V, i G {1, 2} the edge weights T v = (T£, T%), = (A?, B") with 
values in G and the vertex weights C v with values in H by 

a; = lp,^)!/* s?(0 = ia^ 
^ = i^, 1] (i-cT) 5^) = ov^ 
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C v = C(;U v ) + l [uv>1] Q vl 
where C(t,x) is given in fl9]). Define rv R" m with values in H by 

iC:= Y. L l® CVW ( 17 ) 
wev< m 

for m G JVq. 

Proposition 4 27ie rys satisfy 

2 

no \ T v D^i i Hfi^ 

K m — 2.*, * K m-l + ° 
i=l 

/or a// m G IN, v & V. 

The proof is straight forward and easy since * and + interchange. q.e.d. 

Theorem 5 Let (V = {1, 2}*, (G, *, F, 0), ((21, T 2 ), C)) &e the weighted branch- 
ing process as defined above. Then R" m converges in supremum metric on D 
as m — >■ oo a. e. to a rv R v for all v G V. The family R v , v G V satisfies 

2 

R v = Y,T t v R vi + C V (19) 

2=1 

almost everywhere. For every p > 1 and v G V holds 



8 + 2- 1 /PA; p ||g|| p 

|-Rm||oo||p — \ J (20) 



1 kp 



( 2 \ 1 ^ P 

where k p = ( j and Q is a rv with the Quicksort distribution. 

Proof: Let S" m = W m+l - R v m = E w ev m K © ^ = and 6 r , 

| 2)0 o- Notice b m = ||S'^|| 00i 2 does not depend on the vertex v and 



QV \ rpv QVl 

D m — 2-~i » D m-1 
i=l 

by (USD for all m G IV, u G V. 
• & < llt-i for m G W. 

14 



= EsMil^uUSi^)) 2 + (W(l - t/)^_ 1 ({^)) 2 ) 

< ^((^ll^-xlloo) 2 ) +^((1 - ^H^-illoo) 2 ) 
= (i?(f/ 2 )+^((l-f/) 2 )^_ 1 = ^_ 1 

• b m < (l) m/2 b where b = UCU^a < oo. 
Easy by recursion. 

• R m , m G IV, is a Cauchy sequence in (^(H), || ■ ||oo,2)- 
For n < m < n argue 

n-l 



\Rn ~ Rm\\oo,2 < II ^ ] iSz||oo,2 



l=m 

n-l 

l=m l>no 
/2\ n o/2 

- Mo) ~, /VTTTT ^no^oo 



1- VV3 

] H'S'mlloo) = ^ ] || || S m || oo ||l < ^ ] || 1 1 1 1 cxa ||2 < 00 



771 



Let i? be the limit of the Cauchy sequence R m . R(t) is the point wise 
limit of E mG w S m{t) for all t G [0, 1]. 

• R is well defined a.e.. 

Easy by the previous statements. 

• ReF 2 (D). 

Every R m is in ^(D). Since we can write i? = Ylm=o $m a - e - the triangle 
inequality provides the result. 

• Equation ( fl9l) is true. 

The same is true for every R v m , v G V, instead of R m and call R v the limit 
in a.s. sense. Then R v m satisfies f|T8|) and the a.e. convergence provides f|T9j) . 
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For the last statement notice 

HiUoo < \mRl_J V (T 2 i$_l)lloo + Halloo. 

V 

Consider the WBP as above but with cost function the constant C : = 
8 + U v \Q v \, veV, instead of C v . Define 

K ■= E V ^> 

where the symbol \J denotes the supremum. Then ~R" = and ~R" m increases 
in n point wise to R for all v G V and 

R m = T^R m _ 1 V T' 2 u R m __ x + C 

for m G IV. By induction it is easy to show ||-RJJJIoo < ~$m- (Notice HCHoo < 
8.) Then for p > 1 

||-Rto|Ip < ll^l^m-l v T 2 R m _ 1 \\p + ||C|| P 

< (i?|TX-il P + E\T 2 R 2 m _^y /P + 8 + \\UQ\\ P 

< \\Rrn-i\\ p {E\T^ + E\T 2 \p) 1/p + 8 + ||C/Q|| P 

< fc p p m _i|| p + 8 + \\UQ\\ P 

< ±K(s + \\u\\ Mp )< s -±^^ 

im r ° 117? II < 8 + ^ 1/P " g "^ 

\\R\\p = Inn i^ p < - 

m 1 — K p 



q.e.d. 



4 Convergence of the discrete Quicksort pro- 
cess. 

In this section we prove the convergence of finite dimensional marginals of Y n 
to Y. We will define a nice version of Y n such that in L 2 -norm Y n (t) converges 
to Y(t) for every t G [0,1]. This requires to define a nice family (Y^) n of 
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random processes with values in D, indexed by the tree V. We will include the 
Quicksort process via the index oo and consider {Y n ) n ^N , N$ = WU{0, oo}. 
Compare this construction to the examples 3 of the section on the WBP. We 
shall use the general notation of a WBP with binary tree, but now on a more 
general function space. 

Let V = {1, 2}* be the binary tree and 

H = {h : [0, 1] x W ->■ M | Vn G W : h(-, n) G D}. 

Let G 2 be the set of all g : [0,l]x]^-> [0, 1] x No such that g(-,0) = 
0, Vn 6 IV : $ 2 (<?(-, n)) < ^ and $ 2 ((?(-,oo)) = oo where $ 2 denotes the 
projection to the second coordinate. 

Define G = H x G 2 with the semi group operation * 

(fi,9i) * (/2, ^2) := (/1/2 9i, 92 

(Gr, *) has the neutral element (1, id), the function identically 1 and the 
identity, and the grave is (0, id). G operates left on H via 

(f,g)Qh:= fhog. 

The tuple (/, g) has the interpretation of a map Mf t9 from H to H via 
(Mf t g(h))(t,n) = f(t,n)h(g(t,n)). The first coordinate / acts as a space 
transformation, the second coordinate g as a time and index transformation. 
Since H is a vector space and ordered set we will embed G into H H and use 
freely the operations +, •, V 

(M />9 + M fugi )(h) = M Lg (h) + M fugi (h) 

a-(M f>g (h)) = (a-M ftg )(h) 
(M Lg V M fugi )(h) = ((M ; ») V ((M fugi )(h)) 

for a G R. 

Let U v , v G V, be independent rvs with a uniform distribution. Let 
Q^, f G V, be the rvs as in the Quicksort examples 1 and 2 in the WBP 
section. Define on the edges (v,vi),v G V, i G {1,2}, the edge weights 
(transformations) T v = (T" , T^), 7? = Jf)) with values in G and 

the vertex weights with values in H by 

:= \nU'»] K := 5 
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JJ(f,n):=J£-l J%(t,n):=n-F n 

Al(t,n) := l^os^i A v 2 (t,n) : = 1^(1 - ^ 
n ~ n 

\nt\ t — TT" 

B^(t,n) := 1 A — k— i— 5|(t,n):=0V- — 

n u n 

CT{t,n) := C(n, LntJ,/^) + W*^— 



n 



for n G IN and 



oo) := oo J^it, oo) := oo 

Al(t,oo) := ^(t,oo) := W(l - tT) 

^(t,oo):=lA-5- £|(t,oo):=0V*~ ( 



JJV , 1-U V 

C v (t,oo):=C(t,U v ) + l t >uM v Q vl 

t G [0, 1]. AV(t,0) is identically 0. 
Define the rvs R^ 

r>v \ ^ T v r~\ r vuw 

U m ■— 2^ w W 

for m G Wo, u G V. Notice, i?^ takes values in H and i?JJ,(-, oo) is the same 
as the previous Quicksort R° m given in ( |T7|) . The embedding analogous to 
the embedding in example 3 of the WBP section. 

Proposition 6 The rvs R v m satisfy 

2 

R v m = J2 T ?® B %-i + C V (21) 

i=i 

for all m G IV, v G V. 

(By convention oo — 1 = oo.) The proof is easy since * and + interchange, 
q.e.d. 

For every n G IN the function R v m (-,n) converges a.e. as m — > oo to 
R v (-,n) for every v G V. Notice the number of summands increase in m and 
R v (-,n) has only finitely many non zero summands. By induction it is easy 
to show 
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Proposition 7 

y; = ff(-,n) = 5]L;0rb) 

wev 

forn G N ,v G V. 

Proof: The cases n = and n = 1 are easy, since both sides are 0. For the 
induction step use the representation after <^ for Y£ and (121]) for if. We 
show as an example the equality for the first term 

Tl R v (t, n) = Al{t,n)R vl {BZ{t,n),J?{t,n)) 
= Al(t,n)Yj% n) (Bl(t,n)) 



1 



t<m: 



J n 1 V JV J ' 



n 

The rest follows the same line. q.e.d. 
Theorem 8 in i/ie above setting 

n^(t)-nt)ib->^oo0 

/or a// £ G (0, 1] and uG V. 

Proof: Let = if„ + i - i^ and^:= sup te[0)1] sup neW() E((C(t, n)) 2 ). In the 
following let v, w, w G V, m,n G IV , £ G [0, 1]. 

Notice L v w = (A^,, B^, J£) acts as a map on H via (L^ h){t,n) =: 
A v w (t, n)h{Bl(t, n), J v w {t, n)). (We take this as the definition of (A v w , B v w , J v w ).) 
We will use E(L V W ) also as an operator acting on H in the sense E((L V W 
h){t,n) = {{E{Ll)){h)){t,n). We use (L) 2 of an operator via {{Lf){h) = 
(L(h)) 2 . 

Let Am be the a- field generated by all U v , v G V <m . The rv L v w is 
measurable with respect to A\ vw \ and C 1 ' is independent of A\ v \. 

• E(L v w QC vw (t,n) | *4 M ) = 

£K0r(t,n) | A\ vw \) = E(Al(t,n)E(C vw ((B v w ,r w )(t,n)) \ A ]vw] )) = 

• E(L v w C vw (t, n^C^ (t,n)) = {}{oiw^w. 
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For \w\ < \w\ (or vice versa) take the conditional expectation with respect 
to A\ V uj\ and use the previous statement. For \w\ = \w\ use the independence 
of C vw and C vw given A\ vw \ and argue the left hand side is 

= E(Al(t,n)AUt,n)E(C™((Bl,rj(t,n))C^((B^ J*(t,n)) \ A\ vw \) 

= E(Al(t,n)AUt,n)(E(C vw ((Bl,r w )(t,n)) | A lvwl ))(E(C™((B^, J^)(t, n)) \ A\ vw \))) 

= 

• E(S v m S?L) = for m ^ m. 
Square out and use previous results. 

vw\2 



• 

Square out and use previous results. 

• Let b(n,i) = ^V^. Then E(b(n, I n )f < |. 
Notice EI n = ^tll and E {I n f = " ( " +1 f" +1) . 

E(b(n,I n )) 2 = ^( n 2 + 2E(I n ) 2 - 2nEI n -2EI n + 1) 

2 1 12 

3 n 3n 2 ~ 3 

• sup„E we y m ^(snp t ^(t,n)) 2 < (|) m 

Let A^(*,n) = sup t A^(t, n). Notice sup^ sup t A%(t, n) < b{n,I^). The 
recursion for A is 

A" w (t,n) = Av(t,n)A™((Bv,jm,n)). 

We obtain 

sup ]T^(t,n) < sup^(*,n) < 6(n,/«)sup^(*, J?) 
This provides 

E((AU*,n)) 2 ) < E((b(nj:)) 2 SK P E((A-(*,JV(*,n))) 2 \A lvl )) 

i 

< l S upE((AZ(*,j:(*,n))) 2 ) 

By an induction on the length of w we obtain the claim. 
•™P(t, n) mS v m (t,n)) 2 )<a(l) m 
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l.h.s. = E ((K(t,n)Cl((Bl,rj(t,n))) 2 ) 



= E mK(t,n)) 2 E((Cl((Bl,r w )(t,n))) 2 \ A ]vw{ )) 



< a sup E mK(t,n)) 2 ) 




< a £ sup£((^(t,n)) 2 ) 



and t> G V. 

The martingale property follows by E{S^ n {t,n) | .A| v | +m ) = 0, the L 2 - 
statement by the previous claim. 



• E(R v m (t, n) - R v m (t, oo)) 2 ^n^oo for all t G [0, 1] and m G IN. 

Without loss of generality let t be none of the finitely many splitting points 
of the tree vV up to depths m. Estimate n) — i?^(t, oo)|| 2 by the finite 

sum over all w G V <m of the terms \\L V W C vw (t, n)-L v w Q C vw (t, oo)|| 2 . We 
shall show every such term converges to 0. 



l.h.s. = ||^(t,n)C-((K,^)(i^))-^(t,oo)C-((S;,j:)(t,oo))|| 2 

< \\(Al(t,n) - Al(t,oo))C vw ((Bl,rj(t,n))\\ 2 

+\\Al(t, oo)(C™((K, J v w )(t, n)) - C™((Bl, r w )(t, oo))|| 2 

< |K(t,n)-^(t,oo)|| 2 sup sup||C™( S ,i)|| 2 



.sup tjn £((^-iC)M)) 2 <f (|) 




+ J^t, n)) - C™((BZ, rj(t, oo)|| 2 
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The first term converges as n — > oo to since the difference converges to 
a.e. and is uniformly bounded by 1. 

Estimate the second term by the triangle inequality 

< llCC^fct,^))— CC^^) ||2+||ls-(i,«)>c/^ (t ra) -Ci(*, ri,)^^^^— 3L B - (*,oo)<t/— C^^Q 

The first term will converge to 0. Argue U™ converges a.e. to U vw for m — > 
oo. Then dominated convergence provides the statement, since J^(t,n) — > n 
oo a.e. and the function C is bounded. 
Estimate the second term by 

< \\(^BZ(t,n)>U^ (tn) ~ ^Bl{t,oo)<U^)Jll{t,n)Q V j^ tn) \\ 2 

+ \\'&Bv(t,oo)<U™(Jwl( t i n ) ~ UVW )Q V jf 1 (t,n)h 

i ||-n t tvw / /~\vwl r\vwl\\\ 

+ \\n-Bl(t,oo)<U™U W Jl^n) - V m 

< \\TL-BZ,(t,n)>U™ (tn) - lsj,(t,oo)<C/^||2SUp ||Q m || 2 

+||J^,n)-[/™|| 2 sup||Q m || 2 

m 

+ \\Qjv l (t,n)-Q h 



The first term converges to since the difference is bounded and converges 
a.e. to 0. The second term converges to since the difference is bounded and 
converges a.e. to 0. For the third term notice J^ )1 (t,n) converges a.e. to oo 
and b m := \\Q m - Q\\ 2 -^m-s-oo 0. Argue 

WQjl^n) ~ Q h = ^PUVj^m - V ) I A\vwl\)) 

= \\ h J v wl (t,n)\\l^n® 

Combining the above results we obtain the Theorem. q.e.d. 

We come now to the weak convergence of the processes. For a vector 
t = (ti,t2, ■ ■ ■ ,tk) G T*, k G IN and a real valued function / : T — > M let 
f(t) be the vector (f(ti), f(t 2 ), ■ ■ ■ , f(t k )). A finite dimensional distribution 
of a process X = (X(t)) t€T is the distribution of X(t),t G T*. A process 
X n converges weakly to a process X, if all finite dimensional distributions 
converge, i.e. X n (t) converges in distribution to X(t) for all t G T*. 

Corollary 9 The process (Y n (t)) 0<t <i converges weakly to the process (F(t)) <i<i 
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This is an immediate consequence of Theorem [SJ q.e.d. 

The convergence is stated for the half open interval (0, 1]. We could also 
obtain convergence on [0, 1] by redefining Y n appropriate. However then we 
should be very careful about the recurrence relation. See the remark on left 
or right continuity (on D and E) at the end of the introduction. 

Proposition 10 // l f t ->„ t 6 (0,1], % -f t ->„ x and t^x then 

C(n,l n ,i n ) ^ n C(t,x) 

where the C -functions are given in (Ej) and (jHJ). 

Proof: The recursion (J2]) for n > 2 and 1 < I < n implies a recursion for 
a(n,l) = EX(n,l) in n > 2. The solution is given in (j3J) [8]. The solution 
provides the correct values EX(n,l) for the (n, Z)-tuples (1, 1), (1, 0), (0, 0) 
but not for the tuples (n, 0) for n > 2. Therefore we have to be careful by 
plugging in. 

The asymptotics of the harmonic numbers are 

tf„ = lnn+7+ i___I_ + _H_ + („- 6) 

with 7 = 0, 577215... the Euler constant. We will use H n = lnn + 7 + 6 n with 

K = 0(i). 

For n > 2 and 1 < I — l n , i — i n < n argue 
C(n,l,i) = I + 11 + III 



a(i — 1, i — 1) — a(n, I)) 

a{i — 1, z — 1) + a(n — i,l — i) — a(n, /)) 



im — (2(i - 1) - 6(Z) - 6 - 2n + 6(/) - 6 
n n 

+2i^i_! - 2(z - I + 2)# < _, - 2(n + l)£T n + 2(n - Z + 3)/7 n _ m )) 
2 

l t<x (-l + 2x + lim -(i(ln(i - 1) + 7 + b^) 

n n 

-(z-Z + 2)(ln(z-0+7 + M 







1 


/ = 


= Iki 


-( 






n 


J/ = 


= h=i 


1 

-( 






n 


/// = 


= h>i 


1 

-( 




n 


lim 77 = 


= 










lim / = 




(1 
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-(n + l)(lnn + 7 + 6 n ) + (n - / + 3)(ln(n - / + 1) + 7 + 6 n _, +1 )))) 

2 z — 1 z — / 
= l t< J-l + 2x + \im-(i\n (i - I + 2) In 

n n n n 

71 71 — / -I- 1 

-(n + 1) In - + (n - Z + 3) In — - )) 

n n 

= l t <x{-l + 2x + 2xlnx- 2(x - t) ln(x - t) + 2(1 - t) ln(l - t)) 
lim III = l t>a; (l + lim-(2(z-l)-6(z-2) + 2(n-z)-6(Z-z-l)-2n + 6Z) 

+2(n - i + 1)^ - 2(n - Z + 3)H n ^ l+1 + 2i#i_i - 6^1 

-2(n + l)H n + 2(n - Z + 3)# n _ m )) 
2 

= 1^.(1 + lim - (n - i + 1) In n - i) + 7 + &„-i) 

n n 

-(n - Z + 3)(ln(n - / + 1) + 7 + b n _ l+1 ) + z'(ln i + 7 + 
-(n + l)(lnn + 7 + 6 n ) + (n - Z + 3)(ln(n - / + 1) + 7 + 6„_,+i))) 
= l t>a .(l + 2(1 - x) ln(l -x)- 2(1 - t) ln(l - t) + 2xlnx + 2(1 - t) ln(l 

The statement follows easily. 

q.e.d. 
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